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Infection with retroviral vectors leads to 
perturbed DNA replication increasing 
vector integrations into fragile sites 
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Genome instability is a hallmark of cancer. Common fragile sites (CFSs) are specific regions in the human 
genome that are sensitive to replication stress and are prone to genomic instability in different cancer types. 
Here we molecularly cloned a new CFS, FRAllH, in llql3. The genomic region of FRAllH harbors a 
hotspot of chromosomal breakpoints found in different types of cancer, indicating that this region is 
unstable during cancer development. We further found that FRAl IH is a hotspot for integrations of Murine 
Leukemia Virus (MLV) -based vectors, following CD34+ infections in vitro as well as ex-vivo during gene 
therapy trials. Importantly, we found that the MLV-based vector infection in-vitro leads to replication 
perturbation, DNA damage and increased CFS expression. This suggests that infection by MLV-based 
vectors leads to replication-induced genome instability, raising further concerns regarding the use of 
retroviral vectors in gene therapy trials. 

Common fragile sites (CFSs) are specific regions in the human genome that appear as gaps and constrictions 
in metaphase chromosomes from cells grown under mild replication stress conditions \ These sites are 
present in all individuals and are considered to be part of the normal chromosomal structure. In vitro, 
most CFSs are induced using low concentrations of aphidicolin, an inhibitor of polymerase a, 5 and ^ Under 
these conditions, CFSs are hotspots for sister chromatid exchanges, translocations and deletions^"^. At present, 87 
common fragile sites have been characterized at low resolution by their cytogenetic appearance on metaphase 
chromosomes (http://www.ncbi.nlm.nih.gov/gene), but the exact repertoire of common fragile sites depends on 
the inducer, cell type and method of analysis. Using fluorescence in situ hybridization (FISH) 39 of the CFSs have 
been molecularly cloned, and their exact genomic localization determined^. 

In vivo, CFSs correlate with chromosomal breakpoints in tumors^'^. Recently, several studies have analyzed 
genomic deletions in cancer cells and demonstrated that CFSs are preferentially unstable in the early stages of 
cancer development and precede instability in non-fragile regions^""^^. A large scale analysis of 750 cancer cell 
lines revealed an instability signature in CFSs that differed significantly from that found in cancer genes, 
indicating that there is a unique mechanism leading to genomic instability in fragile sites^. CFSs were also found 
to be involved in the occurrence of duplications and amplifications in cancer^^"^^. The breakpoints setting the 
amplicon boundaries are within the CFSs and they drive breakage-fusion-bridge (BFB) cycles. 

Altogether, the results of these studies suggest that DSBs at fragile sites may themselves contribute to oncogen- 
esis by triggering genomic instability that can lead to disruption of tumor suppressor genes or oncogenes 
harbored within these regions. 

Another type of instability at CFSs involves the integration of foreign DNA into the genome. In vitro studies 
have shown that under replication stress conditions, plasmid DNA integrates preferentially into CFSs^^. In vivo, 
CFSs were also found to be preferential sites for integration of viral DNA. Human papillomavirus (HPV), the 
most important cancer related virus, was found to integrate preferentially into CFSs in the genome of most 
cervical cancer cells^^. Another oncogenic retrovirus found to preferentially integrate into CFSs is the Xenotropic 
Murine Leukemia related virus (XMRV)^^, a relatively new type of retrovirus associated with a subset of prostate 
tumors^". 

We have previously found preferential integrations of retroviral -derived vectors, based on Moloney Murine 
Leukemia virus (MLV), into CFSs^^ both in vitro and in gene therapy trials. Unfortunately, in two gene therapy 
trials on patients with X-linked severe combined immunodeficiency (SCID-Xl), five of the 20 treated patients 
developed monoclonal acute lymphoblastic leukemia-like lymphoproliferation. The leukemia in four of these 
patients involved activation of the LOM2 proto oncogene^^'^^, which resides within the CFS FRA11E^\ 



SCIENTIFIC REPORTS | 3 : 2 1 89 | DOI: 1 0. 1 038/srep02 1 89 



1 





FRA11H 



RP11-149G19 



RP11-15L8 



1 



RP11-70G14 

RP1 1-1456115 
RP11-890N8 



63,400,001 



)0N8 73 



77,100,000 



11q13.1 



11q13.2 



11q13.3 



11q13.4 



11q13.5 



65,206,659 
65,245,755 



67,061169 

67,214,087 

67,909,037 



71,859,43? 73,043,319 
71,859,667 73,073,670 
71,861,227 

73,144,540 



MLV integrations in SCID patients 

Figure 1 | The positional cloning of FRAllH. (a and b) Examples of FISH signals using clones from llql3 on chromosome 11 expressing FRAllH. 
(a) Right, chromosome 1 1 (stained with DAPI) expressing FRAl IH; Left, FISH signals using RPl 1-809N8 (green) and RPl 1-15L8 clones as probes (red). 
Note that the FRAl IH gap is in between the clones, (b) Right, chromosome 1 1 (stained with DAPI) expressing FRAl IH; Left, FISH signals using RPl 1- 
15L8 as a probe (red). Note that the FISH signals are on the FRAllH uncondensed region, (c) Schematic representation of chromosome llql3. The 
horizontal bar above the chromosome represents the CFS, FRAl IH region. The gray boxes represent the BACs used for the cloning of the CFS. The vertical 
lines under the chromosome represent integration sites of MLV-based vectors found in the SCID patients. The numbers indicate the genomic position of 
each integration. 



Here we studied a region in llql3 known to be a hotspot for 
chromosomal instability in many cancer types. Recurrent amplifica- 
tions in this region are found in 15% of breast carcinomas, 13% of 
lung cancers, 21% of bladder tumors, 50% of esophageal cancers and 
45% of oral squamous cell carcinomas^^. Recently, a large scale ana- 
lysis of 50 gastric tumors revealed that the llql3.3 region is among 
the most common genomic deleted regions^^. 

The llql3 band encompasses 13.7 Mb, in which a CFS FRAllH 
was characterized by low resolution cytogenetic mapping (http:// 
www.ncbi.nlm.nih.gov/gene/?term= FRAllH). However, whether 
this CFS contributes to the instability in llql3 remains unclear. 

Here we show that the recurrent instability leading to amplifica- 
tions in 1 lql3 is within the common fragile site FRAl IH. We further 
reveal that FRAllH is a preferential site for integration of MLV- 
based vectors, both in vitro and in gene therapy trials. More impor- 
tantly, using the high resolution DNA combing approach we show 
that following infection by a MLV-based vector, the replication 
dynamics of the genomic DNA is significantly perturbed, leading 
to the formation of DNA double strand breaks (DSBs) and the 
expression of fragile sites on metaphase chromosomes. These find- 
ings shed new light on viral-host interactions, and raise new concerns 
regarding the use of viral vectors in gene therapy trials. 

Results 

In order to investigate whether the breakpoints leading to the recur- 
rent amplifications in llql3 are within the CFS, FRAllH, we ana- 
lyzed constriction gaps and breaks on metaphase chromosomes from 
GM00847 cells treated with a low concentration of aphidicolin 



(0.4 |iM). Fluorescence in situ hybridization (FISH) was performed 
using BAC clones as probes. These clones flank the region of recur- 
rent genomic instability found in different cancer types in llql3 
(Supplementary Table 1). The results revealed that both BAC clones 
show FISH signals proximal, distal or on both sides of the chromatin 
constrictions and gaps in the llql3 region (Figure 1 and Table 1). 
These results indicate that a genomic region of ~7 Mb flanked by the 
analyzed probes is part of the CFS, FRAl IH, and that this fragile site 
is probably involved in chromosome instability in different cancer 
types. 

We further studied whether FRAllH is a preferential site for 
retroviral integrations. For this we first analyzed 1238 retroviral 
integration sites (RISs), mapped in cells of patients with X-linked 



Table 1 Summary of FISH results. The position of the FISH probes 
relative to the gaps and constrictions in 1 1 ql 3. The results of the 
adjacent BACs RPll -45615, RP11-707G14 and RP11-809N8 
v/ere pooled 


BAC name 


Telomeric 


On Split Centromeric 


RPl 1-15L8 


19 


3 - 3 


RPl 1-149G19 


16 


1 3 


RPl 1-456115 
RPl 1-707G14 
RPl 1-809N8 


8 


3 - 9 
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SCID (SCID-Xl) treated with a retrovirus -based gene therapy vector 
carrying the y chain (yc) gene^^'^^. Our analysis revealed 10 retroviral 
integrations into the ~7 Mb of FRAllH (Figure 1), a significantly 
higher frequency than the frequency in the entire non-fragile regions 
in the genome (binomial test p < 0.001). We then analyzed more 
than 40,000 MLV integrations into human CD34+ cells recently 
mapped by Cattoglio et al.^^. Ofthese, 2197 unique integrations were 
mapped on chromosome 11. This massive integration mapping 
allowed us to study the distribution of viral integrations into CFSs 
at a higher resolution and statistical power. First we analyzed viral 
integrations into FRAllH and compared this to the frequency of 
integrations along the entire chromosome 11 (excluding centro- 
meric, telomeric sequences and other mapped CFSs: FRAllE, 
FRAllF and FRAllG (Supplementary Table 2)). Our analysis 
revealed that the 7 Mb FRAl IH region harbors 170/2197 MLV inte- 
grations, which is significantly higher than the frequency in the 
110 Mb non- fragile regions in chromosome 11 (p = 1.63*10"^). 

Previously, we analyzed a limited number of MLV integrations 
into another fragile site on human chromosome 11, FRA11E^\ Here 
we analyzed the viral integration into FRAllE in the large integ- 
ration database mapped by Cottoglio et al.^^ The analysis revealed 
an extremely high MLV integration frequency, with 67 integrations 
into the 2 Mb of FRAl IE, significantly different from the integration 
frequency into the entire chromosome 11 (p = 2.12*10"^°). These 
results are consistent with our previous report on a limited number of 
integration sites mapped in SCID patients, which indicated that 
FRAl IH is a hotspot for MLV integrations in human CD34+ cells^\ 

We further aimed to investigate the basis for the preferential inte- 
grations of retroviral MLV-based vectors into CFSs. CFSs are sens- 
itive to replication stress conditions and under these conditions their 
chromatin is uncondensed even in mitosis. Hence, we raised the 
possibility that MLV retroviral infection leads to replication stress 
resulting in open chromatin conformation in fragile sites which 
might predispose these regions to the MLV integration protein 
complex^^. To test this possibility we infected HeLa cells with an 
MLV-based vector (based on pTG-nls-LacZ plasmid), with high 
infection efficiency (—96%) (Figure 2). Using the high resolution 
DNA combing method, which enables replication analysis of single 
DNA molecules (an example is shown in Figure 3a), we analyzed the 
DNA replication rate 24 hours following infection, and compared it 



to the rate in control cells, grown for 24 hours in a medium from 
packaging cells that does not produce virions. Our results revealed a 
significant reduction (—35%) in the replication rate following viral 
infection. The mean replication rate for the control HeLa cells was 
1.32 ± 0.04 Kb/min (n = 111), whereas the MLV based-vector 
infection significantly decreased the rate to 0.86 ± 0.02 Kb/min (n 
= 237) (p < 7.8*10"') (Figure 3b). Interestingly, following MLV 
infection the distribution of fork rates changed dramatically. In the 
control cells, in 25% of the forks the rate was >1.5 Kb/min, whereas 
only 3.3% of the forks progressed at this high rate following infection 
with the MLV-based vector. In order to verify that MLV-induced 
replication stress is a general phenomenon and is not cell- specific, 
we further analyzed the replication dynamics in normal human 
fibroblast cells expressing hTERT (BJ). As was found for HeLa cells, 
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Figure 2 | Examples of infection efficiency by MLV-based vector, 
expressing nls-Lac Z in (a) HeLa, (b) BJ cells. Immunofluorescence with 
anti-p-gal (green), nuclei were stained with DAPI (Blue) BF - bright field 
cells. 
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Figure 3 | The replication rate in HeLa and BJ cells 24 hours after 
infection with a retroviral-based vector, (a) Example of a single combed 
DNA molecule labeled with IdU (green) and CldU (red). The unstained 
gap represents the position of the origin (ORI). (b) Box plot of fork rates in 
HeLa cells infected by a retroviral-based vector (HeLa-Retro) (n = 237) 
and control cells (HeLa-control) (n = 197) (p-value = 7.8*10"^) (these 
results are a summary of two independent infections), (c) Box plot of fork 
rates in BJ cells infected by a retroviral-based vector (BJ-Retro) (n = 94) 
and control cells (BJ-control) (n = 108) (p-value = 1.188e-07); (these 
results are a summary of two independent infections). Lower hinge defines 
the 25*^ percentile, upper hinge defines the 75* percentile. The median is 
shown as a line across the box. Dimond shape indicates the average. 
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the replication rate significantly (p-value = 1.188e-07) decreased 
following the MLV-based vector infection, from an average of 2.05 
± 0.07 Kb/min (n = 108) in the control non-infected cells, to 1.55 ± 
0.06 Kb/min (n = 94) in BJ cells infected by the MLV-based vector 
(Figure 3c). Furthermore, in BJ infected by the MLV-based vector a 
dramatic (4- fold) increase in the fraction of slow forks (< 1 Kb/min) 
was found, from 3.7%, in non-infected control cells to 14.8% in the 
infected cells. The perturbed replication dynamics described in BJ 
cells is clearly an under- estimation of the phenomenon, since only 65 
± 7% of the cells were infected by the vector (performed in three 
independent experiments), as revealed by X-gal positive staining. 
Altogether, our results indicate that a short time after retroviral- 
based vector infection, the DNA replication was dramatically per- 
turbed. This replication perturbation may lead to the collapse of 
replication forks, resulting in DNA damage. 

In order to study whether the replication perturbation caused by 
retroviral infection is the result of high viral titer, leading to high 
multiplicity of infection (MOI), we infected HeLa cells with GFP 
reporter gene, by a low MOI (1 MOI) of MLV-based vectors. 24 h 
post infection, cells were sorted by fluorescence- activated cell sorting 
(FACS), to differentiate GFP positive and negative cells. Replication 
analysis using DNA combing revealed a significantly slow fork pro- 
gression rate in the GFP cells (average of 0.6 ± 0.02 Kb/min (n = 
127)), compared to the GFP negative cells (average of 1.0 ± 0.04 Kb/ 
min (n = 71)) (p-value = 1.441e-ll). 

Lentiviral -based vectors are considered as good candidate vectors 
for gene therapy trials due to their potential to mediate a prolonged 
therapeutic gene transfer, both in dividing and non- dividing cells. 
We further analyzed the effect of lentiviral infection on the replica- 
tion dynamics in HeLa cells infected with a lenti Green Fluorescent 
Protein (GFP) vector. Our results show that even with a high titer (10 
MOI), the replication fork rate remains similar to the observed rate in 
the non-infected (GFP-negative) cells (average of 1.0 ± 0.03 Kb/min 
(n = 107)) (p-value = 0.89) (Supplementary Figure 1). This result 
indicates that replication stress caused by MLV-based vectors, is not 
a general characteristic of viral infection. 

We further explored whether the replication perturbation con- 
ferred by retroviral MLV vector infection leads to fragile site express- 
ion. For this we analyzed HeLa cells 24 hours after infection with a 
retroviral -based vector. Our results show that retroviral infection 
leads to a significant (P < 0.005) level of gaps and constrictions in 
the infected cells (1.93 CFSs/metaphase) compared to non-infected 
control cells (1.19 CFSs/metaphase) (Figure 4). Importantly, half of 
the retroviral-infected cells showed a significant number of fragile 
sites (^2). This indicates that infection with a retroviral-based vector 
leads to the expression of CFSs as a result of replication stress. 

In order to investigate the possibility that the replication perturba- 
tion induced by the retroviral MLV-based vector infection leads to 
accumulation of DSBs we analyzed yH2AX foci, which are known to 
be formed at DSBs (Figure 5a)^°'^^ in HeLa cells 24 h post-infection 
with an MLV-based vector. Immunofluorescence analysis revealed 
significantly (p < 8.9*10"^) more yH2AX foci in HeLa cells 24 h 
post infection by an MLV-based vector, with an average of 2.25 ± 
0.4 foci/cell (n = 60) compared to 0.87 ± 0.2 foci/cell (n = 61) in the 
control cells (Figure 5b). Next we analyzed DSB-formation in BJ cells 
infected with an MLV-based vector. Because the infection in BJ cells 
was less efficient (only 67% of the cells were positive to the x-gal 
staining (data not shown), or the ap— gal antibody (Figure 2b)), we 
analyzed yH2AX foci only in p— gal positive cells (Figure 5c). 
Immunofluorescence analysis revealed a significant (p < 5*10"^^) 
increase in yH2AX foci in BJ cells, 24 h post- infection by the MLV- 
based vector, with an average of 3.5 ± 0.2 foci/cell (n = 105), com- 
pared to 0.95 ± 0.2 foci/cell (n = 103) in the control cells (Figure 5d). 
In order to verify that the observed effects of the MLV-based vector 
infection resulted from the viral infection and not from the intro- 
duction of foreign DNA, we analyzed the formation of DNA damage 
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Figure 4 | CFS expression in metaphase chromosomes following 
retroviral vector infection, (a) Example of metaphase chromosomes from 
HeLa cells, 24 h after infection with a retroviral-based vector. White 
arrows indicate CFSs. (b) Distribution of metaphases according to the 
number of gaps and constrictions per metaphase in HeLa cells, 24 hours 
after retroviral-based vector infection (HeLa-Retro) (n = 100) or control 
cells (HeLa-control) (n = 100) (p < 0.005); (these results are a summary of 
two independent infections). 

following tranfection with a plasmid DNA which is the backbone of 
the retroviral-based vector. As the transfection efficiency of the retro- 
viral plasmid was only >20%, we transfected HeLa cells with the 
retroviral plasmid encoding GFP (pLVC-GFP). No significant dif- 
ference (p = 0.67) in the number of yH2AX foci in the GFP positive 
cells (2.4 ± 0.3, n = 57) was found compared to the GFP negative 
cells (2.5 ± 0.2, n = 320) (Figure 6), although the transfection itself is 
toxic, resulting in DNA damage and cell death. These results indicate 
that transfection of plasmid DNA by itself does not lead to replica- 
tion-induced DSBs, indicating that such DSBs are specific to the 
infection by retroviral virons. 

In summary, our results reveal that retroviral infection leads to 
replication perturbation and DSBs suggesting a mechanism for the 
significantly higher frequency of retroviral integration into CFSs. 

Discussion 

Genome instability is a hallmark of cancer cells. One type of instab- 
ility is amplifications that increase the expression of oncogenes. The 
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Figure 5 | DSBs accumulated following retroviral-based vector infection, (a) Examples of HeLa cells infected with MLV-based vector following 
immunofluorescence with anti-(3-gal (green) and yH2AX (red). The green outlines mark positive |3-gal cells. Nuclei were stained with DAPI (blue), 
(b) Percent of nuclei with the indicated number of yH2AX foci. HeLa infected by a retroviral based vector (HeLa-Retro, n = 60), control cells (HeLa- 
control) (control, n = 60), (p < 8.9*10"^). (c) Examples of BJ cells infected with MLV-based vector following immunofluorescence with anti-(3-gal 
(green) and yH2AX (red). The green outlines mark positive |3-gal cells. Nuclei were stained with DAPI (blue), (d) Percent of nuclei with the indicated 
number of yH2AX foci. BJ cells infected by a retroviral-based vector (BJ-Retro, n = 105), control cells (BJ-control) (control, n = 103), (p < 5*10~^^); 
(these results are a summary of two independent infections). 
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Figure 6 | DSBs following transfection with a plasmid DNA encoding for GFP. (a) Examples of HeLa cells infected with MLV-based vector following 
immunofluorescence with anti-GFP (green) and yH2AX (red) . The nuclei were stained with DAPI (blue) . (b) Percent of nuclei with the indicated number 
of yH2AX foci. HeLa cells expressing GFP following transfection (HeLa-GFP, n = 57), tranfected cells that do not express GFP (HeLa-Negative) (control, 
n = 320), (p = 0.67); (these results are a summary of two independent transfections). 



high frequency of amplifications found in llql3 indicates that this 
region harbors an oncogene/s which may give a proliferative advant- 
age to the cancer cells. Indeed FRAllH harbors the oncogene 
CCNDl that encodes the cyclin D protein, which is involved in 
different types of cancer^^. However, the llql3 amplified region in 
these cancers is significantly larger than the oncogene, indicating that 
in addition to the selection force, other mechanisms contribute to 
llql3 instability. One such mechanism involves the inherent instab- 
ility of CFS under replication stress conditions. Instability in CFSs 
was shown to drive genomic amplifications by setting the boundaries 
of the amplicons. This mechanism was described for the amplifica- 
tion of different oncogenes including MET (FRA7G)^''''*''^ PIP 
(FRA7iy\ SMAD5 (FRA5C)'' among others'^'^l The preferential 
breaks promoting genomic amplifications within the replication- 
sensitive CFSs indicate that the instability in these sites involves 
replication stress conditions. Several studies have indicated that there 
is replication stress at early stages of cancer development^^'^^ which is 
a result of aberrant activation of oncogenes^"'^^'^^. Our results 
revealed a large fragile site (at least 7 Mb long), FRAllH, in 
1 lql3, which may explain the large amplifications at this site in many 
cancers. 

A different type of genome instability is the integration of foreign 
DNA into the human genome. Retroviral integrations are mediated 
by viral and cellular proteins, and are distributed along the genome 
with a preference for transcription start sites (TSS) and regulatory 
elements^^'^^'^^ and DNase I hypersensitive sites'*^. Large scale ana- 
lyses of retroviral based vectors have found preferential integration of 
viral DNA into CFSs'^''\ Here we showed that the CFS FRAl IH is a 



preferential site for MLV-based vectors. Many fragile sites are 
involved in genomic instability in cancer^^. It is important to note 
that llql3, the chromosomal band harboring FRAllH, is a known 
hotspot for genomic instability in different types of cancer 
(Supplementary Table 1), although further studies are required to 
shed light on the importance of FRAllH sequences as a hotspot for 
genomic instability in cancer. 

We further showed that in vitro infection by a retroviral -based 
vector leads to perturbed DNA replication of the host cell, as indi- 
cated by the decrease in replication rate and the increase in DNA 
damage (Figures 2 and 4). The infection also leads to CFS expression 
(Figure 4), which appears as uncondenced open chromatin at meta- 
phase chromosomes. 

The uncondensed chromatin at CFSs compared to the non-fragile 
DNA might lead to higher accessibility of these regions to the viral 
integration complex, which may account for the high number of 
MLV integrations into FRAllH and FRAl IE. Integration to the 
open chromatin of CFSs is consistent with previous results showing 
preferential MLV integration into other open chromatin regions in 
the genome, such as TSS^°, regulatory elements^^ and DNase I hyper- 
sensitive sites^^. 

Altogether, our findings showing that in vitro infection by retro- 
virus vectors leads to replication perturbations raises new concerns 
regarding the safety of these vectors. This replication perturbation 
may increase genome instability and mutation rates that could lead 
to tumorigenicity. This risk is greater especially in patients with 
challenged immune systems, such as those with SCID. Hence, future 
studies should explore the molecular basis of this replication stress. 
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and the ways in which it affects specific risks associated with gene 
therapy trials and treatments. 

Methods 

Cells and growth conditions. The simian virus 40-transformed human fibroblast cell 
line GM00847 (Coriell Cell Repository, Camden, N.J.) was grown in Eagle minimal 
essential medium supplemented with 10% fetal calf serum 100,000 U/1 penicillin 
andlOO mg/1 streptomycin. HeLa cells were grown in Dulbecco's modified Eagle's 
medium supplemented with 10% fetal calf serum, 100,000 U/1 penicillin and 100 mg/1 
streptomycin. 

Retroviral-based vector infection and tranfection. For the MLV-based vector 
infection, we used packaging cells stably expressing a pseudo-type gibbon ape 
leukemia virus (GALV) envelope (TGA), and the pTG-nls-LacZ plasmid which is a 
MLV-based vector (a gift from Prof Moshe Y. Flugelman, Technion, Israel), and from 
Prof. Bertrand Boson (Universite de Lyon, Lyon, France.)^^. 

Evaluation of the LacZ gene transfer to HeLa cells was performed by X-GAL 
staining. 

For non- viral expression of GFP pLVC-GFP plasmid, we transfected cells using the 
TransIT-HeLaMONSTER Transfection Kit (Mirus Bio LLC). 

Preparation of cells for cytogenetic analysis. Cells infected with the MLV-based 
vector were grown in 2.2 mM caffeine (Sigma-Aldrich) for 24 h before fixation in 
order to avoid cell cycle arrest conferred by replication-induced DNA damage during 
the S-phase. On the day of fixation, cells were grown with 0.1 ug/ml Colcemid for 40 
minutes, trypsinized and incubated with 0.4% KCl in 37°C for 40 minutes. Cells were 
fixed with methanol- acetic acid (3:1) and spread by dropping on cover slips. 

Fluorescent in situ hybridization (FISH). DNA BAC clones were labeled with 
digoxigenin DY-505-aadUTP (Applied Spectral Imaging) by nick translation. FISH 
on metaphase chromosomes was performed as previously described^^. 

Cytogenetic analysis of hybridization signals and fragile site expression. Green 
and red fluorescence were visualized using a Nikon B-2A filter cube. For weak signals 
a modified Chromatech HQ-FITC (Chroma Technology, Brattleboro, Vt.) filter set 
was used (excitation, 460 to 500 nm; emission, 520 to 600 nm). Images were captured 
with an intensified charge -coupled device imager (Paultek Imaging, Grass Valley, 
Calif) and digitized with a frame grabber (Imascan/MONO-D; Imagraph, 
Chelmsford, Mass.). The Image-Pro PLUS program (Media Cybernetics, Silver 
Spring, Md.) was used to measure the fragile site-telomere distance relative to the total 
length of the p arm of chromosome 1 1 and compared it to the GDB mapping of the 
fragile sites, as previously described"^^. 

Molecular combing. Unsynchronized cells were pulse labeled for 30 min by a 
medium containing 100 mM of the thymidine analog iododeoxyuridine (IdU). At the 
end of the first labeling period, the cells were washed twice with a warm medium and 
pulse labeled once more for 30 min with a medium containing 100 mM of another 
thymidine analog, chlorodeoxyuridine (CldU). Cells were then harvested, and genomic 
DNA was extracted, combed, and analyzed as previously described'^^'*^. The primary 
antibody for fluorescence detection of IdU was mouse anti-BrdU (Becton Dickinson), 
and the secondary antibody was goat anti-mouse Alexa Fluor 488 (Invitrogen). The 
primary antibody for fluorescence detection of CldU was rat anti-CldU (Novus 
Biologicals). The secondary antibody was goat anti-rat Alexa Fluor 594 (Invitrogen). 
The length of the replication signals was measured in micrometers and converted to 
kilo bases according to a constant and sequence-independent stretching factor (1 mm 
= 2 Kb), as previously reported"^*^. 

Statistical analysis. A one-tailed t-test was performed for the replication rate analysis, 
yH2AX foci and chromosomal CFS expression. A binomial test was used to examine 
the frequency of MLV integrations into CFSs, compared to non-fragile regions. 
Statistical analysis was performed on at least two biological repeats. 

Immunofluorescence for detection of yH2AX foci and Pgal. Cells were fixed in 
3.7% formaldehyde/PBS for 10 min, permeabilized with 0.5% Triton/PBS, and 
blocked with 5% BSA/PBS. The primary yH2AX antibodies used were mouse anti- 
YH2AX (Upstate Biotechnology). Appropriate secondary antibodies were added, Cy3 
conjugated (Jackson ImmunoResearch Laboratories) or Alexa Fluor 594 (Invitrogen). 
For P-gal analysis we used anti- P-Galactosidase (Promega) as a primary antibodies, 
and Alexa Fluor 488 (Invitrogen) as a secondary antibodies. Images were taken with a 
Bio-Rad confocal microscope. For focus information analysis at least 50 nuclei for 
each condition were analyzed. 
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